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During a germinal center reaction, random mutations are introduced into immunoglobulin V 
genes to increase the affinity of antibody molecules and to further diversify the B cell reper- 
toire. Antigen-directed selection of B cell clones that generate high affinity surface Ig results In 
the affinity maturation of the antibody response The mutations of Ig genes are typically base- 
pair substitutions, although DNA insertions and deletions have been reported to occur at a low 
frequency. In this study, we describe five Insertion and four deletion events in otherwise so- 
matically mutated V H gene cDNA molecules. Two of these insertions and all four deletions 
were obtained through the sequencing of 395 cDNA clones M 10.000 nucleotides) from 
CD38 + IgD" germinal center, and CD38~igD~ memory B cell populations from a single hu- 
man tonsil. No germline genes thai could have encoded these six cDNA clones were found af- 
ter an extensive criaracterlzatlon of the genomic V H 4 repertoire of the tonsil donor. These six 
insertions or deletions and three additional insertion events isolated from other sources oc- 
curred as triplets or multiples thereof, leaving the transcripts in frame, Additionally. 8 of 9 of 
these events occurred in the CDR1 or CDR2, following a pattern consistent with selection, 
and making it unlikely thai these events were artifacts of the experimental system. The lack of 
similar Instances in unmutated IgD*CD38" follicular mantle cDNA clones statistically associ- 
ates these events to the somatic hypermutation process (P = 0.014). Close scrutiny of the 9 in- 
sertion/deletion events reported here, and of 25 additional insertions or deletions collected from 
the literature, suggest that secondary structural elements in the DNA sequences capable of pro- 
ducing loop intermediates may be a prerequisite in most instances. Furthermore, these events 
most frequently involve sequence motifs resembling known Intrinsic hotspots of somatic hy- 
permutation. These insertion/deletion events are consistent with models of somatic hypermu- 
tation involving an unstable polymerase enzyme complex lacking proofreading capabilities, and 
suggest a downregulatlon or alteration of DNA repair at the V locus during the hypermutation 



L^sporL. B cells hone the specificity of their antibody 
molecules through a process of random somatic hypermu- 
tation of their V genes, followed by antigen driven selec- 
tion. This is collectively referred to as affinity maturation. 
Ttiis process occurs within the germinal centers (GCs) 1 or 
secondary follicles from peripheral lymphoid organs when 




antigen stimulated B cells receive proper signals from T and 
accessory cells. In the human system, GC B cells are char- 
acterized by the surface expression of CD38 and, in most 
cases, the loss or IgD (1-3). We have previously shown that 
the initiation of somatic hypermutation occurs within the 
CD77 + subset of these lgD-CD38 4 B cells (4) Mutated V 
genes can be isolated from all subsequent stages of B cell 
differentiation and in cells from all IgD" and certain IgD* 
B cell subsets (4. 5). The molecular process of somatic hy- 
permutation remains elusive, primarily due to the lack of a 
good In vitro model until very recently (6). Much of what 
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EXHIBIT C 



s known concerns: (a) localizing the somatic hypermuta- 

ings (4, 7-10); (b) delineating the limits and rates of muta- 
ional activity (II), (t) determining the minimal substrate 
hrough transgenic technology (12, 13); and (d) analyzing 



ences 12 and 13). 

Although somatic hypermutation is typically described as 
the generation of bp substitutions, insertions and deletions 
have been sporadically described. As with somatic point 
mutations, the analysis of these events can provide valuable 
information concerning somatic hyperrnutauon itself. Anal- 
ysis of human V H 4 family genes generated from the ampli- 
fication of cDNA from somatically mutated GC (IgD' 
CD38') and memory (IgD CD38 ) B cell subpopulaiions 
led us to identify a number of cDNA clones from the mu- 
tated cell populations that contained Insertions and dele- 
tions. We provide evidence that these events are linked to 
the somatic hypermutation process. Additionally, these 
events occur in a predictable fashion relative to the sur- 
rounding sequence, suggesting a model for their occur- 
rence with implications for the molecular process of so- 
matic hypermutation. 



Materials and Methods 

Isolation. Labeling, and Sorting of Tonsil B Celts. Human tonsils 
were obtained during routine tonsillectomy. B cell Isolation and 
sorting for CD38 and IgD expression were performed as previ- 
ously described (4. 14). In brief, human tonsillar B cells were sep- 
arated into lgD'CD38- follicular mantle (FM) 8 cells igD" 
CD38* GC B cells, and [gD-CD36- memory B cells to 95-98% 
purity as predicted by FACS* analysis, as previously described (13) 
The mutation state of the V H gene cDNA clones from the var.ous 
subpopulatlons was in agreement with our previous study (4). 
Clones were considered somatically mutated If tliey contained two 
or more bp substitutions, well beyond the expected error rates for 
the avian myeloblastosis virus reverse transcriptase (AMV-RT). 
Taq, and PFU polymerases used in these analyses (this mutation 
rate Is based on our previous analyses; reference 4). 

Sequencing ihe Ig V H Transcripts. Total RNA was extracted 
from 1-5 X 10 s B cells using guanidlnlum thiocyanate- phenol- 
chloroform In a single step using the Ultraspec RNA Isolation 
system (BIOTECX Laboratories. Houston. TX). and was reverse 

oligonucleotides 8 Cp.12 (5^CTCCACT^VcACACCAc' 
CTC-3') for IgM transcripts or C7I8O (5'-CTGCTGACC- 
CACTAGACTCC-3) for IgC transcripts and SuperScrlpt II reverse 
transcriptase (GIBCO BRL. Calihersburg. MD). First strand cDNA 

PCR using internal primers corresponding to the Cp, or C7 con- 
stant regions in combination with V H 4 or V H 6 family-specific 
leader oligonucleotides: C7MO. 5'-CCCAACCTCTCCACGCC- 
GCTC-3'. Cu.10. 5'-TCTGTCCC CTGCATCACGTC-3' L-4 
5 ' - ATC AAAC ACCTGTGGTTCTT- 3 ' , L-6, 5'-ATCTCTGT- 
CTCCTTCCTCAT 3' The PCR products were purified using 
microconcentrators (Amlcon. Beverly. MA), and then were ki- 
nased and blunt-end llgated into an EcoRV-dlgested and dephos- 



phorylated pBluescript plasmld (Slratagene. La Jolla. CA; Poiynu 
cleotlde Kinase, T4 DNA Llgase, and EcoRV were from Boehrlnge 
Mannheim, Amsterdam, Netherlands). After transformation 
electroporation into electro-competent DHlOa Escherichia 
(CIBCO BRL) and screening with consensus Internal ollgo 

ck. t.de as previcu Is des. need II IS) p, ,„ e 

and automated sequencer proiocc^ (ABI-377: Advanced Blote 
nologies Inc.. Columbia, MD). All sequences were analyzed 
mg DNAstar IDNAstar Inc. Madison. WI). In the first tonsil 
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V H 4 complementarity -determining region (CDR)l-speclfic probes 
were sequenced (395 or 583 clones), thus enriching the somati- 
cally mutated populations analyzed. In that the CDR1 probes 

The frequency of the occurrence of these events can therefore 
only be predicted to be between 6 out of 395 and 6 oui of 583 
clones (1-2%), Any sequence of Interest was resequenced In both 
directions 10 ensure sequence fidelity. 

Characterizing the Genomic Repenolre. Total genomic DNA was 
Isolated from FM B cells (IgD*, CD38-) using the Puregene DNA 
Isolation kit (Centra Systems. Inc., Minneapolis. MN). V H 4 genes 
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Clones identified in the cDNA analysis tl 

amplify both the exact sequence of clones with Insertions/dele- 
tions as found and the predicted sequences based on the proposed 
germllne counterparts. Oligonucleotides used in this analysis 
(Format. Is as follows: clone: exact/predicted): gG4:5'-CCACCC- 
CTTGTACTTGGTTCC-375'-CCACCGGTTGTACCTC- 
TCC-3'; gl44:5'-TCTTCAGCCACCCGTTGCTGT-375'- 
TCTTGACCGACGGGTTGT-3': gJ87:5'-CAGCTCCAGTAG- 
TAACCCCCC-375' CACCTCCACTACTAACCACCC; Pise- 
s' -GACGGATTGTAGTTGCAGCC-3'/5'-CAGGCCTTGT- 
ACTTCCTCCC; gl92:5'-CCACCCCCACTACTACTAACT- 
37(same); and g80:5'-CCCGATCCAATACCTCACACT-37 
5'-CCGGATCCACTAGTAACC-3'. 

Sequence Availability. All cDNA sequences with insertions or 
deletions, acid any genomic sequences unique to the literature as de- 
scribed In Die results section are available from EMBL/Cenbank/ 
DDBJ under accession numbers AF013615 through AF013626. 

Assay lor Srreenlng V H Cent Lengths. To facilitate the analysis 
of large numbers of V H gene transcripts for the presence of Inser- 
tions or deletions, first strand cDNA produced as described above 
was PCR amplified using Expand high fidelity polymerase (Boeh- 
nnger Mannheim) to reduce errors resulting from Taq poly- 
merase alone. The products of this PCR amplification were cloned 
as described above and screened using ^P-labeled, gene-specific oli- 



used to directly inoculate 25-u.l PCR amplification mixtures in 
96-well-rormat PCRs. The internal PCR reactions used J2 P-labeled, 
gene-speclflc oligonucleotides to amplify a 230-base fragment in- 
cluding the V H 4-39 CDRI (L-4, as above, and V H 4-39-3': 5'- 
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GCTCCCACTATAATAGATACT-3') or for analysis of V„6 
genes a 166-nucleoiide fragment Including ihe CDRI and CDR2 
of V H 6 (V H 6FWl: 5'-TCCCATCTCCCCCCACAGTCT-3'. 
V H 6FW3: 5'-TCTCTCTGCGTTCATCCTTAT-3'). Allquots 
of each clone were also used to inoculate amplifications of CDR3 
regions using FW3-speclflc (ssFW3: 5'-CTCAA[C/C]CT- 
CACCTCTCTGACIT/CI) and C|i- or C-y-speclflc oligonucle- 
otides (Cp.D: 5' -GGAATTCTCACAGGAGACGA-3'. C7-HO 
as above) to analyze the diversity of the populations under study: 

sequences cloned in (his analysis were used to produce an ex- 
pected distribution of CDR3 sizes for comparison (see Fig- 5 B). 
The amplification products were electrophoreses on 0.6X-TBE. 
5% urea-acrylamide sequencing gels (Long Ranger: J.T. Baker. 
Phillipsburg. NJ) and analyzed with a Phosphorlmager (Molecu- 
lar Dynamics, Sunnyvale. CA.) using the (mage Quant software 
supplied by ihe manufacturer. Clones that differed from the ex- 
pected size and ttiose clones In lanes adjacent to aberrantly mi- 
grated bands were used to produce plasmtd preparations from 
which ihe inserts were sequenced in either direction. 

Storing of Insertion/ Deletion Evenls. (n the results section. In- 
sertions and deletions are scored as evenls per ICf nucleotides 
within the customary boundaries of CDRI and CDR2. This unit 

are generally only found in the CDR regions and therefore Ihe 

In the PACE analysis. ^ach V H 4-39 FM clone included only ihe 
CDRI (21 nucleotides) within a total of 230 nucleotides/clone. 

eluded both theCDRI and CDR2 ^CDR nucleotides). In the 
sequencing analysis, various B cell populations were analyzed in- 
volving a wide range of overall lengths. Comparisons of the fre- 
quency of insertions/deletions just within the CDRs allowed for 
a more standardized and quantitative analysis, and for more free- 

Bacuksvlrus Expression System. Cloning and coexpression of 
clone pg86 and k light chain FS6k in the baculovirus expression 
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Figure 1. Predicted amino 
acid sequences of nine cDNA 




„ble f>om CenBWDDBJ under 
accession numbers A F0 1 36 15 
through AF01 3626 



system was performed as previously described (17). Recombinant 
Autographs California nuclear polyhedrosis v.rus (AcMNPV) was 
cloned using the pH360NX transfer vector and expressed in Sf9 

Capture EUSA for y Heavy Chain, and k Light Chains. Expres- 
sion of lecombiiianl antibodies ,,f lone pg86 coexpicssed with K 
light chain FS6k were measured by capiure EUSA. Wells were 
coaled with goal anti-human IgL. and titubated with supematanl 
of recombinant pg86/FS6x added in serial twofold dilutiuns. 
Bound antibody was detected using alkaline phosphatase-conju- 

bance was measured at 405 nm In an EUSA plate reader. 



Results 

Insertions and Deletions into Immunoglobulin V H Cents. 
In a large scale analysis of V H genes from both the IgM and 
IgG compartments of B cell subpopulallons separated from 
a single human tonsil, six clones that contained DNA inser- 
tions were apparently selected in that they involved nucle- 
otide triplets or multiples of nucleotide triplets, leaving the 
cDNAs (transcripts) In frame, and they were localized to the 
CDRI and CDR2 (Fig. 1, A and Bj. The six clones with 
insertions or deletions were identified from the sequencing 
or 395 cDNA clones M 10.000 nucleotides) from GC and 

<2% of clones analyzed (~1 event/ 18.000 nucleotides). 
All six evenls were In IgG transcripts. Two events were ob- 
tained from lgD-CD38^ GC and four events from 
!gD~CD38~ memory cell populations. None of the IgM V H 
cDNAs analyzed from this tonsil had insertions or deletions, 
although we have observed such events In IgM transcripts 
in the past and in subsequent analyses, as described below. 




genes. The primary variability between V„4 tamily members is 3-6-bp 
(Heltons lhac we attribute to somatic hypei muldiwi In the selected B cell 
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slightly belwee 


individuals (16. 18). As shown in Fig. 2. 


the major differ 


nee between V H 4 genes involves the length 


ofCDRl. Beca 


use genomic diversity between V H 4 family 








possible alternative explanations for these 




a) different alleles of the detected genes; (6) 


rarely expressed 


or otherwise unknown V H 4 gene family 






aer artifacts of the experimental system. To 


address these iss 


es, both the expressed and genomic reper- 






Table \ °Zomo 


1 18 V H 4-39. 2 out of 49 V H 4-31 , 1 out of 


87 V H 4-34.and 


out of 45 V H 4-59 cDNA clones contained 


lnsertlon/deletic 


n events. cDNA clones were Judged as 


unique isolates t 


ased on CDR3 analysis, and the few Iso- 




ed to be clonally related differed in their 



patterns of somatic mutation beyond the level explainable 
by reverse transcription and PCR errors (maximum: >l 
mutation/500 nucleotides of V H gene sequence as prevl- 
ously described |4|). 

To characterize the genomic repertoire of the Initial ton- 
sil. 80 germline V H 4 gene clones were isolated and se- 
quenced (Table 1), which encompassed all 14 V H 4 family 
members or alternate alleles represented in the 446 cDNA 
clones analyzed from all of the tonsillar B cell subsets. In 

two apparently functional V H 4 genes not found as cDNA 

tlonal V H 4 gene and a divergent polymorphism of a known 
V H 4 pseudogene. The proposed germline counterparts of 
each of the V H 4 genes containing insertion/deletion events 

genomic isolates of V H 4-31 and of V H 4-39 were cloned. 
V H 4-34 and V H 4-59 were isolated 1 1 and 4 limes, respec- 
tively. No germline genes were isolated that could have en- 
coded the Insertion/deletion events described. 



Table 1 . cDNA and Germline Clones Isolated 







cDNA 


Total Germline 


alleles isolated" 


c 


ones with cDNA clones clones 
ins/del isolated isolated' 


V H 4-39 




2 


113 7 


V H 4-3I 




2 


49 8 


V„4-59 






45 4 


V H 4-34 






87 11 


V H 4-34 related 




0 


0 4 


V H 4-55 pseudo 






0 12 








0 3 


V H 4-04 




0 


17 7 


V H 4-04-related 


pseudogene 5 


0 


0 2 


V H 4-6I 




0 


25 7 


New V H 4 gene 1 




0 


33 3 


V H 4-04B 




0 


72 1 


V„4-28 




0 


0 1 


■Nomenclature b. 
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3 PCR artifact 










nserllon or deletion 










HVlldogCI i L 




or frameshlfl m 


ulatlons and are not 






l closely rc-Litc 


to V H 4-04. 



described herein were not germline encoded, two sets of 
PCR primers were designed to specifically recognize: (a) 
the exact sequence of the events; (b) the predicted, unmu- 
lated, germline sequence corresponding to the cDNAs con- 
used to amplify genomic DNA from this individual, yield- 
ing negative results (data not shown). The unique nature of 
these events relative to both the expressed and genomic 
repertoire and our Inability to amplify genomic counter- 
parts for these events by PCR suggest that they are not germ- 

The Proposed Insertion/Deletion Events Are Not the Result of 
(Vh/VuJ Recombination. As in most V gene repertoire 

the result of either PCR splicing by overlap extension arti- 
facts, or reciprocal homologous recombination between un- 
rearranged V genes (19). However, none of these likely artl- 

the insertion or deletion of DNA described above. A num- 
ber of artifacts of this type had been isolated in the cDNA 
analysis as well; such artifacts are common to V gene analy- 
ses (20), The cDNA isolates with deletion and insertion 

cDNA isolates and were found to be unique relative to both 
the expressed and germline V H 4 gene repertoires of this In- 

The Insertions and Deletions Are Associated with Somatic Hy- 




ns the ho(-PCR products CP-labeled) of 
an individual clone, (ffl A comparison < 
es of the 485 CDR3s assayed to the dijl 

ctrophoresls were a polyclonal population. CDR3 slz 



and the CDR3 
CDR3 
500 CDR3s OS- 



S' Cp. or C-, r, 

l assayed, The . 
st CDR3 fount 



CDR3 le, U !ll« lu, Ih- 



analyze 



s. This w 



ng either direct sequencing or 
ns of the V H genes spanning 
e comparisons on polyacryl- 
ss that ran aberrantly, and the 
sequenced (75 out of the 485 



PCR amplification of p 
the CDRs, followed b 
amide gels (Fig. 3). Any 
clor.es in adjacent lanes, 
clones). None of these 
CDR3 homology. To ensure that the remaining 410 FM 
clones were polyclonal, the CDR3s were PCR amplified 

V H gene amplification products for size comparisons (Fig. 
3 A). The size distribution of these CDR3s was similar to 
that of ~500 V H gene sequences analyzed In this study 
(Fig. 3 B). providing evidence that our FM sample is poly- 



The si. 



i froi 



from 395 mutated cDNA clones (25,482 CDR nucleo- 
tides), corresponding to a frequency of 2.35 events/10' 1 CDR 
nucleotides. This is significantly different (p=0.014 by a 
one-sided x 2 test) from the analysis of unmutated FM- 
derived clones (25.515 CDR nucleotides) that yielded no 
insertions or deletions (Table 2). 

In the course of the analysis described above, we isolated 
one IgM clone containing a 6-nucleotide insertion into 
framework (FW)3 (see below). We believe that this clone 
is part of the mutated GC or memory repertoire because it 
contained 4 bp substitutions In addition to the insertion. In 
this study, (he B cell populations analyzed were 95-98% 
pure, and the FM B cell subpopulatlon could therefore in- 
clude between 2 and 5% contaminating clones, that is, 
IgM-expressing cells not from the naive population that 
can therefore be somatically mutated. However, none of 



Table 2. Analysis of Unmulated FM cDNA Clones for Insertion or Deletion Events 



vM-FM.CDRl' 

/ H 6 IgM FM V H genes* 

y H 4 family FM sequences 



5.565 
16,500 

3,450 
25.515 



'Everm per I0 4 CDR nucleotides. 
'Expected frequency (events/10' CDR nu 
otldes/10 J ) = 2 35. 

'St.imiu-,,! »iu| v ,» x'' lest tor independent 



)f the CDR I and CDR2. (See Maler.ab and Methods for a more detail 




•nee data available from EMBL/GenBank/DDBJ under ac- 
rrs AF0I3GI5 ihrougli AP0I3626. 



d FM do, 



analyzed 



>r dele- 



Otha Insertions and Deletions into V H Genes. We have ob- 
served similar instances of insertions and deletions into the 
coding regions of apparently functional Immunoglobulin V 
genes. Including: (a) a V H 6 IgM Isolate containing a triplet 
duplication/insertion into the CDR1 in addition to several 
bp substitutions (Figs. 1 C and 4 A), which was derived 
from a human hybridoma secreting high affinity mAb 
against Bordciella pertussis (21. 22); (b) a 6-nucleotlde inser- 
tion into the FW3 region of a mutated IgM V H 6 gene, rep- 

the CDRs (Figs. 1 ) Cand 4 A. clone tml2l); and w'an ^6- 
nucleollde duplii al ,on' 'nisei lion ii no a human plasma tell 
cDNA transcript at the boundary between the FWl and 
CDRI (Figs. I Cand 4 Q. doubling the length of this hy- 
pervariable loop. The viability of clone pg86 was tested by 
expressing it in the baculovirus system in association with a 
K light chain encoding construct (FS-6k; Fig. 5). The effi- 
cient expression, secretion, and pairing with light chain In 
I fie baculovirus system suggest that the product of clone 
pg86 is a functional heavy chain despite the large duplica- 



The Insertions and Deletions Are Related to 
Sequence. As shown in Fig. 4, Ihe insertions reported 
duplications of the Immediately adjacent sequence, and 



id tntf 



deletions involve elements of repetitive tracts, In addilion, 
a higher incidence of ihese events involve sequence motifs 
thai resemble intrinsic hotspois of somalic hypermutation 
(12, 23-27): (a) four of eight events Involved ihe serine 
codon AGC that has been reported as the "hottest" of 
hotspots (24- 27) (Fig. 4. sequences HBp2, g)87. g]88. and 
g86); (i) two events involved TAC motifs (Fig. 4. gl92 and 
g64): and (c) two events involved the motif AAC (Fig. I. 
g!44. and tml2l). In general all of the clones found to 
contain insertions and deletions were highly mutated (Fig. 
1). Several of these clones had bp substitutions clustered 
with the insertions or deletions (Figs. I and 4). The plasma 
cell iranscript depicted in Fig. 4 C contained an 18-nucle- 
otide Insertion thai duplicated the 5' adjacent sequence. 
The central nine nucleotides of ihe duplicated sequence 
form a partial palindrome (..GGtCaCtCC). This clone 
was mutated (C to A at position 80 and an A to T at posi- 
tion 85) before the duplication/insertion event, as these 



mutations w 



■d seque 



Somatic modification of V genes encoding immunoglob- 
ulin and T cell receptors recapitulates most mechanisms 
observed In the evolutionary diversification of DNA: (a) V 
gene recombination, including imprecise junctions, P nu- 
cleotides, and untemplaled N nucleotide addition: (i>) gene 
conversion: and (c) bp substitutions in Ig somalic hypermu- 
tation. The insertion and deletion of nucleotides is another 
means lor the evolutionary diversification of DNA. and has 
been proposed as an explanation for unusual V gene se- 
quences In che past (Table 3). In this study, we show that 
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^ Complexities of the Analysis of Insertions and Deletions into V 

been a daunting task because of their low frequency, and the 
complexity of the germline V H repertoire. According to 
our study, these events occur in <2% of somatically mu- 
tated clones. As shown In Fig. 2, the primary variability be- 

tionary diversity and somatic diversification was expected, as 
the molecules are likely subject to the same functional and 
structural constraints. This has made it difficult to determine 
whether these events were generated somatically, versus germ- 
line encoded, or if they were artifacts of the experimental 

In fact, V H genes may exhibit particularly unstable se- 
quence characteristics evolved to help support both germ- 
suggested by the identification of intrinsic hotspots of so- 
matic hypermutation within the CDRs of V genes (25. 26). 
Perhaps the area of greatest contention In thts complex sys- 

the AMV-RT. Taq. cJVu polymerases. md/oTlh7cW>n- 

ing in E. coli. 

The Insertion/Deletion Events Are The Result of the Somatic 
Hypermutation Process. Our syslem addresses several key 
issues that associate the occurrence of Insertions and dele- 
tions to the somatic hypermutation process, (a) Six of the 
nine insertions/deletions were identified within the V H 4 

tal system that could be characterized extensively as de- 
scribed below, (fc) All of the Insertion/deletion events re- 
ported involved triplets or multiples of triplets, leaving the 





were localized to the CDRs. As with 




ions, no insertions or deletions were 




o 120 nucleotides Of constant region 


(Cp, or C^DNa' 


sequenced with each cDNA clone. 


These hallmarks or so 


matic hypermutation and selection ar- 


gue strongly that tht 


se events are not artifacts, (c) The B 


cells analyzed were 




pure, mutated B Cf 


til populations including CC (IgD" 


CD38*) and memory 


(IgD-CD38-) B cells, and an unmu- 


tated FM B cell popu 


lation (I g D-CD38-). making it possl- 



sertion and deletions to the somatic hypermutation process 
(P = 0 014). In addition, the isolation of four of the Inser- 
tion/deletion events from me mory B cells provides evi- 
dence that these events did not result from artifacts related 
to contamination from endonucleoiytically cleaved DNA 
from the apoptotic CC cells, id) Seven of nine events re- 
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ported in this study involved y heavy chains that contain 
nearly twice the mutations of p. heavy chains (4), further 




previously described hotspots of somatic hypermutation. pro- 
viding evidence that these events occur by the same pro- 
cess. (I) Finally, we extensively analyzed the V H 4 gene fam- 



ily of the tonsil donor at both the expressed and genomic 
levels, facilitating the assignment of the insertions/deletions 

with insertions and deletions were unique among 395 V H 4 
cDNA clones sequenced from a single tonsil, including many 
independent isolates of each of the V H 4 genes expressed 
(Table 1). In addition, we were unable to isolate genomic 

by PCR or through the extensive characterization of the 
genomic V H 4 repertoire of the tonsil donor (Table 1). Tern- 
plating of these events from any other V H gene family can 

families differ significantly in the CDR sequences where 

Structural and Functional Considerations of Insertions and De- 
letions into V H Cenes. The events involving the insertion 
or deletion of a single amino acid from the CDR1 or 
CDR2 would not be expected to profoundly alter the back- 
bone structure of these molecules, as the CDRs are the 
most malleable portions of antibodies. The clone g80 has 
two of the five amino acids that are customarily considered 
its CDR 1 deleted, leaving only three amino acids to form 
this hypervarlable loop (Fig. 1 B). Thus, this is one of the 
shortest CDRls reported to date. The clone tml21 has 
two amino acids inserted into the FW3 region. The por- 
tion of the FW3 where this insertion occurred is believed 
to be solvent exposed and corresponds to the region where 
the B cell superantigen staphylococcal protein A binds to 
most V H 3-encoded Ig molecules (28); therefore, it is likely 
that the Insertion into this V H 6 clone can be tolerated as a 
loop or bulge on the molecule's surface. The most complex 
structural change observed In our study Involved clone 
pg86. with a six amino acid insertion at the FW1/CDRI 
junction that would presumably double the length of this 
hypervariable loop and require dramatic structural accom- 
modation. However, we were able to express this heavy 
chain and found It paired with light chain, Indicating that it 
is likely functional (Fig. 5). The clone HBp2, containing a 
triplet insert into its CDR1. is particularly interesting be- 
cause it has a known specificity. This V H 6 gene was Isolated 
from a human B cell hybridoma with and- Bordetella pertus- 
sis specificity (21, 22) Clone HBp2 has also been expressed 
In the baculovirus system and is fully functional. We are 
currendy performing mutational analysis of this heavy chain 

plays a role in the affinity and/or specificity or this antl- 

Analysis of Insertions and Deletions Reported in the Litera- 
ture. Various groups have reported a number of insertion 
and deletion events (Table 3). Virtually all of the insertions 
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5enger transgenes. 19 out of 25 insertions or deletions Into 
somatically mutated genes involved predominantly repeti- 
tive elements, or in several cases other sequence patterns as- 
sociated with secondary structures such as Internal homolo- 
gies or inverted repeats (Table 3). With the inclusion of the 
9 events described in this work, 28 out of 34 Insertions and 
deletions Involved such elements. Thus, the proximity of 
sequence elements that can be predicted to cause secondary 
structural changes In the DNA seems to be a hallmark of 
insertions and deletions into somatically mutated V H genes. 

A Model for the Occurrence of Insertions and Deletions during 
Somaiic Hypermutation. The evidence for the involvement 
of DNA secondary structure in the production of insertion 
or deletion mutations during somatic hypermutation, as sug- 
gested in 1986 by Colding et al. (29). now seems unequiv- 
ocal. The insertions and deletions described in our study, 
and those illustrated In Table 3. occur in a predictable fash- 
ion, involving sequence motifs that could form loop inter- 
mediates reminiscent or the replication slippage model or 
Strelsinger et al. (30) and Ripley and Gltckman (for review 
see 31) as presented in Fig. 6. Such mutations are postu- 
lated to occur when DNA polymerase slips or stutters and 
the newly synthesized strand shifts on the template and re- 
anneals to an adjacent repetitive element, producing un- 
paired loop intermediates localized to one or the other 
strands, if this unpaired loop Intermediate is not repaired 
then it will be perpetuated as an insertion or an instance or 

A Possible Correlation to Intrinsic Holspots. A higher fre- 
quency or somatic hypennutatlon has been reported to oc- 
cur at sequence motifs referred to as intrinsic holspots (for 
review see reference 12). Interestingly, every insertion/de- 
letion event reported in our study resembled one of these 
hotspots (AGC. TAC. and AAC: references ]2 and 27; Fig. 



4). The analysis of selected populations may have influ- 
enced this tendency because seven out of cighi of these 
events occurred in the CDRs where it has been shown thai 
hotspot motifs are preferentially found (25, 26). Further 
more, only a weak correlation to hotspots could be found 
for the previously reported insertions/deletions involving 
unsclected regions or V loci (Table 3). However, the single 
event found in this analysis that occurred outside of the 
CDRs in FW3 (clone lml21. Figs. 1 C and 4 A), also in- 
volved a tandem of possible hotspots (AAG. AAC). A mote 
extensive and directed analysis Is required to fully address 
this Issue. 

Implications for lite Molecular Meclianism of Somaiic Hyper- 
mutation. The Instability of repetitive tracts during DNA 
replication is a hallmark of defects in poslreplieative mis- 
match repair (33). and the locus specific downregulatlon of 
DNA mismatch repair in response to UV irradiation has re- 
cently been reported for Immunoglobulin V H genes In 
freshly sorted GC B cells (CD38'IgD") compared to man- 
tle zone B cells (CD38-|gD\- reference 34). In a recent 
study by Tran et al. (35). it was shown that tract instability 
of homonucleotirie runs associated with mismatch repair 
delects occur more frequently in long than in short runs. 
These authors suggested that ir loop Intermediates occur in 
long repetitive tracts (>8 bp for a homonucleolide run) 
they could Involve a distal repetitive element out of reach 
of the polymerase proofreading activity and only be sub- 
jected to mismatch repair. However, for short repetitive 
tracts, as for the events reported in this analysis, loop inter- 
mediates can only occur proximal to the polymerase com- 
plex and are therefore subjected to both polymerase proof- 
reading and mismatch repair mechanisms. 

All 9 events In this analysis, and 19 out of 25 events from 
the literature (28 out of 34 Insertions and deletions re- 
ported), appeared to result horn secondary structural inter- 
mediates. Loop Intermediates proximal to the polymerase 
complex during DNA polymerization should be repaired 
by the polymerase proofreading mechanisms immediately, 
or by the postreplicative DNA repair systems This analysis 
suggests the following characteristics for the polymerization 
process during somatic hypermutation. (a) The polymera.se 
interacts with the V locus in a particularly unstable or 
"loose" rashlon, especially when hotspot motife or elements 
capable of forming secondary structures are encountered, 
allowing bp substitutions in most Instances, and Insertions 
or deletions via polymerase slippage at a much lower fre- 
quency, (6) It has limited proofreading capabilities; and (r) 
there is a downregulatlon of postreplicative mismatch re- 
pair. An efficient means to downregulate mismatch repair 
during somaiic hypermutation could be through the lack of 
differentiation of the template and progeny strands for the 
mismatch repair system; lack or strand differentiation has 
been shown to increase the rate of mutations Introduced 
(36). Such a system would be advantageous for trie locus- 
speclflc V gene somatic hypermutation in that it could In- 
volve alterations or a single enzymatic complex (polymerase 
complex) rather than multiple systems (proofreading and 
mismatch repair). Another system, which would have the 
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same advantage, I.e.. the alteration of a single complex, 
would be the alteration of a DNA repair system such as 

tion of postreplicative mismatch repair at the V locus in the 
rapidly proliferating centroblasts that are undergoing so- 
matic hypermutatlon or due to a polymerase enzyme with 
such a high fault rate as to overwhelm any repair. 

All currently accepted models of somatic hypermulation. 
whether related to DNA excislon-repalr-llke systems or 
transcription-repair, or to DNA polymerization or reverse 

ris-factors in the V locus (enhancers etc ) folk d by tl 
activity of unknown polymerase enzymes of some type. 
This analysis does not refute or corroborate any of these 
models directly, but it does provide further characterization 
of the polymerization system involved, based on the types 



of mutations observed and on the molecular biology that Is 
known to cause such mutations. This analysis and the 
model presented here provide further Information or crite- 
ria to be contemplated as the various possible polymerase 
systems involved are considered 

Conclusions. Insertions and deletions into immunoglob- 
ulin V H genes during somatic hypermutatlon are additional 
means by which the Immunoglobulin repertoire can be di- 
versified. These events display characteristics supporting mod- 
els of somatic hypermutation involving a particularly unsta- 
ble or error-prone polymerase to allow the introduction of 
mutations, and involving the downregublion of DNA repair 
to allow the perpetuation of these mutations. Additionally, 
we show thai these events tend to Involve sequence motifs 
resembling intrinsic hotspots of somatic hypermutatlon, sug- 
gesting thai the polymerase complex is destabilized in a se- 
quence-specific manner to allow preferential mutation at these 
sequence elements. 
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